Finding an Upper Limit in the Presence of Unknown Background 
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Experimenters report an upper limit if the signal they are trying to detect is non-existent or 
below their experiment's sensitivity. Such experiments may be contaminated with a background too 
poorly understood to subtract. If the background is distributed differently in some parameter from 
the expected signal, it is possible to take advantage of this difference to get a stronger limit than 
would be possible if the difference in distribution were ignored. We discuss the "maximum gap" 
method, which finds the best gap between events for setting an upper limit, and generalize to the 
"optimum interval" method, which uses intervals with especially few events. These methods, which 
apply to the case of relatively small backgrounds, do not use binning, are relatively insensitive to 
cuts on the range of the parameter, are parameter independent (i.e., do not change when a one-one 
change of variables is made), and provide true, though possibly conservative, classical one-sided 
confidence intervals. 

PACS numbers: 06.20.Dk, 14.80.-j, 14.80.Ly, 95.35.+d 



I. INTRODUCTION 



Suppose we have an experiment whose events are dis- 
tributed along a one-dimensional interval. The events 
are produced by a process for which the expected shape 
of the event distribution is known, but with an unknown 
normalization. In addition to the signal, there may also 
be a background whose expectation value per unit in- 
terval is known, but one cannot completely exclude the 
possibility of an additional background whose expecta- 
tion value per unit interval is non-negative, but is other- 
wise unknown. If the experimenters cannot exclude the 
possibility that the unknown background is large enough 
to account for all the events, they can only report an up- 
per limit on the signal. Even experimenters who think 
they understand a background well enough to subtract 
it may wish to allow for the possibility that they are 
mistaken by also presenting results without subtraction. 
Methods based on likelihood, such as the approach of 
Feldman-Cousins jjj, or Bayesian analysis, cannot be ap- 
plied because the likelihood associated with an unknown 
background is unknown. An example of this situation is 
analysis of an experiment which tries to detect recoil en- 
ergies, Erccoii, deposited by WIMPs bouncing off atoms 
in a detector. For a given WIMP mass, and assumed 
WIMP velocity distribution, the shape of the distribu- 
tion in Erccoii can be computed, but the WIMP cross 
section is unknown, and it is hard to be certain that all 
backgrounds are understood. The simplest way of deal- 
ing with such a situation is to pick an interval in, say, 
-E'recoil) an d take as upper limit the largest cross section 
that would have a significant probability, say 10%, of 
giving as few events as were observed, assuming all ob- 
served events were from WIMPs. One problem with this 
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naive method is that it can be very sensitive to the in- 
terval chosen. It is typical for the bottom of a detector's 
range of sensitivity to be limited by noise or other back- 
grounds. Thus if the interval extends to especially low 
-E'recoili there will be many events, leading to a weaker 
(higher) upper limit than is required by the data. On 
the other hand, experimenters could inadvertently bias 
the result by choosing the interval's endpoints to give es- 
pecially few events, with an upper limit that is lower than 
is justified by the data. In order to avoid such a bias, it 
might be thought best to avoid using the observed events 
to select the interval used. But the procedures discussed 
here take the opposite approach. The range is carefully 
chosen to include especially few events compared with 
the number expected from a signal. The way the range 
is chosen makes the procedure especially insensitive to 
unknown background, which tends to be most harmful 
where there are especially many events compared with 
the number expected from a signal. It would be a mis- 
take to compute the upper limit as if the interval were 
selected without using the data; so the computation is 
designed to be correct for the way the data are used. 

While the methods described here cannot be used to 
identify a positive detection, they are appropriate for 
obtaining upper limits from experiments whose back- 
grounds are very low, but non-zero. These methods have 
been used by the CDMS experiment . 



II. MAXIMUM GAP METHOD 

Figure illustrates the maximum gap method. Small 
rectangles along the horizontal axis represent events, 
with position on the horizontal axis representing some 
measured parameter, say "energy" , E. The curve shows 
the event spectrum, dN/dE, expected from a proposed 
cross section, a. If there is a completely known back- 
ground, it is included in dN/dE. But whether or not 
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FIG. 1: Illustration of the maximum gap method. The horizontal axis is some parameter, "E" , measured for each event. The 
smooth curve is the signal expected for the proposed cross section, including any known background. The events from signal, 
known background, and unknown background are the small rectangles along the horizontal axis. The integral of the signal 
between two events is "x". 



there is a completely known background, we assume there 
is also an unknown background contaminating the data. 
To set an upper limit, we vary the proposed size of a 
until it is just high enough to be rejected as being too 
high. We seek a criterion for deciding if a proposed sig- 
nal is too high. Since there are especially many events 
at low E, while dN/dE is not especially high there, those 
events must be mostly from the unknown background. If 
we only looked at the low energy part of the data, we 
would have to set an especially weak (high) upper limit. 
To find the strongest (lowest) possible upper limit, we 
should look at energies where there aren't many events, 
and therefore isn't much background. 

Between any two events, E\ and E- l+ \, there is a gap. 
For a given value of a, the "size" of the gap can be char- 
acterized by the value within the gap of the expected 
number of events, 

f Ei+1 dN 

The "maximum gap" is the one with the greatest "size" ; 



it is the largest of all the X{. The bigger we assume a to 
be, the bigger will be the size of the maximum gap in the 
observed event distribution. If we want, we can choose o 
so large that there are millions of events expected in the 
maximum gap. But such a large a would be experimen- 
tally excluded, for unless a mistake has been made, it is 
almost impossible to find zero events where millions are 
expected. To express this idea in a less extreme form, a 
particular choice of <r should be rejected as too large if, 
with that choice of a, there is a gap between adjacent 
events with "too many" expected events. The criterion 
for "too many" is that if the choice of a were correct, a 
random experiment would almost always give fewer ex- 
pected events in its maximum gap. Call x the size of 
the maximum gap in the random experiment. If the ran- 
dom x is lower than the observed maximum gap size with 
probability Co, the assumed value of a is rejected as too 
high with confidence level Cq. Since x is unchanged un- 
der a one-one transformation of the variable in which 
events are distributed, one may make a transformation 
at a point from whatever variable is used, say E, to a 
variable equal to the total number of events expected in 
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the interval between the point and the lowest allowed 
value of E. No matter how events were expected to be 
distributed in the original variable, in the new variable 
they are distributed uniformly with unit density. Thus 
any event distribution is equivalent to a uniform distri- 
bution of unit density. The probability distribution of x 
depends on the total length of this uniform unit density 
distribution, and in this new variable the total length of 
the distribution is equal to the total expected number of 
events, Li, but it does not depend on the shape of the 
original event distribution. Co, the probability of the 
maximum gap size being smaller than a particular value 
of x, is a function only of x and fi: 



^ (kx - fi) k e- 



-kx 



k=0 



k\ 



k 



kx 



(2) 



where m is the greatest integer < fx/x. For a 90% con- 
fidence level upper limit, increase a until fi and the ob- 
served x are such that Co reaches 0.90. 

Equation ^ can be evaluated relatively quickly when 
Co is near 0.9. When li is small, so is m, and when li 
is large, the series can be truncated at relatively small k 
without making a significant error. Equation^ is derived 
in Appendix A. 

While this method can be used with an arbitrary num- 
ber of events in the data, it is most appropriate when 
there are only a few events in the part of the range 
that seems relatively free of background (small li) . The 
method is not dependent on a choice for binning because 
unbinned data are used. No Monte Carlo computation of 
the confidence level is needed because the same formula 
for Co applies independent of the functional form for the 
shape of the expected event distribution. The result is a 
conservative upper limit that is not too badly weakened 
by a large unknown background in part of the region un- 
der consideration; the method effectively excludes regions 
where a large unknown background causes events to be 
too close together for the maximum gap to be there. 



III. OPTIMUM INTERVAL METHOD 

If there is a relatively high density of events in the data, 
we may want to replace the "maximum gap" method by 
one in which we consider, for example, the "maximum" 
interval over which there is 1 event observed, or 2 events, 
or n events, instead of the zero events in a gap. 

Define C n (x, li) to be the probability, for a given cross 
section without background, that all intervals with < n 
events have their expected number of events < x. As for 
Co of the maximum gap method, so long as x and /i are 
fixed, C„ is independent of the shape of the cross sec- 
tion and the parameter in which events are distributed. 
But C n (x,Li) increases when x increases, and it increases 
when n decreases. C n can be tabulated with the help of a 
Monte Carlo program, although the special case of n = 
can be more accurately computed with Eq. 0. Once n is 
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FIG. 2: Plot of C , Max(-9, aOi the value of CMax for which the 
90% confidence level is reached, as a function of the total 
number of events fi expected in the experimental range. 



chosen, C n can be used in the same way as Co for obtain- 
ing an upper limit: for x equal to the maximum expected 
number of events taken over all intervals with < n events, 
C n (x,fi) is the confidence level with which the assumed 
cross section is excluded as being too high. But since we 
do not want to allow n to be chosen in a way that skews 
results to conform with our prejudices, the optimum gap 
method includes automatic selection of which n to use. 

For each interval within the total range of an actual 
experiment, compute C n (x,fi) for the observed number 
of events, n, and expected number of events, x, in the 
interval. The bigger C n is, the stronger will be the evi- 
dence that the assumed cross section is too high. Thus 
for each possible interval, one may quantify how strongly 
the proposed cross section is excluded by the data. The 
"optimum interval" is the interval that most strongly in- 
dicates that the proposed cross section is too high. The 
optimum interval tends to be one in which the unknown 
background is especially small. The overall test quan- 
tity used for finding an upper limit on the cross section 
is then Cviax, the maximum over all possible intervals of 
C n (x, li). A 90% confidence level upper limit on the cross 
section is one for which the observed Cyiax is higher than 
would be expected from 90% of random experiments with 
that cross section and no unknown background. 

The definition of Cm & x seems to imply that its determi- 
nation requires checking an infinite number of intervals. 
But given any interval with n events, x, hence C n (x,fj,), 
can be increased without increasing n by expanding the 
interval until it almost hits either another event or an 
endpoint of the total experimental range. For determi- 
nation of Cyiax one need only consider intervals that are 
terminated by an event or by an endpoint of the total ex- 
perimental range. If the experiment has N events, then 
there are (N + 1)(N + 2)/2 such intervals, one of which 

has C n (x,fi) = C M ax- 

The function Cmzx.(C, fi) is defined to be the value 
such that fraction C of random experiments with that 
fi, and no unknown background, will give CMax < 
CMax(C, fx). Thus the 90% confidence level upper limit 
on the cross section is where CMax of the experiment 
equals CWx(-9, fJ>), which is plotted in Fig. 0. 
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FIG. 3: <TMed/o"True, the typical factor by which the upper 
limit cross section exceeds the true cross section, when Co is 
used (dotted lines), when pMax is used (dash-dotted lines), 
when Civiax is used (dashed lines), and when the Poisson 
method is used (solid lines). These ratios are a function of 
fi, the total number of events expected from the true cross 
section in the entire experimental range. For the upper figure 
(a) there is no background, and for the lower figure (b) there 
is just as much unknown background as there is signal, but 
the background is concentrated in a part of the experimental 
range that contains only half the total signal. 



A Monte Carlo program was used to tabulate C n (x, fi). 
A Fortran routine interpolates the table to compute 
C n [x,jjL) when n, x, and fi are within the tabulated 
range. The routine applies when < fx < 54.5 and when 
< n < 50. 

The function, CMax(C fi), has been computed by 
Monte Carlo and tabulated for fi < 54.5 and various 
C. Certain peculiarities of this function are discussed in 
Appendix B. 

Routines to evaluate functions described in this paper, 
along with tables they use, are available on the web ||. 



IV. COMPARISONS OF THE METHODS 

We compare the maximum gap (Co) and optimum in- 
terval (Cviax) methods with each other, with the stan- 
dard [EJ way of finding an upper limit ("Poisson"), and 
with another method (pMax) described in Appendix C. 

The standard "Poisson" confidence level C upper limit 
cross section is the one whose fi would result in fraction C 
of random experiments having more events in the entire 
experimental range than the n actually observed. This 



fraction C is 

°° ,.k rfJ. j-n 

Pfon + 1)= J2 1J^ = / dt -^- ( 3 ) 

/c=n+l ' J ° 

The last equality is proved by observing that both sides 
have the same derivative, and they have the same value 
at n = 0. P(x,a), the incomplete Gamma function, is 
in CERNLIB g) as GAPNC(a,x), DGAPNC(a,x), and 
GAMDIS(x,a). 

The description of the PMax method is relegated to 
Appendix C because although pMax is somewhat easier 
to implement than CMax, it was found to be less powerful. 

Two comparisons of the effectiveness of the methods 
were performed: tests "(a)" and "(b)". For test (a), 
500, 000 zero-background Monte Carlo experiments were 
generated for each of 40 assumed cross sections. Co, 
PMax 7 CMax, and the Poisson method were used to find 
the 90% confidence level upper limits on the cross sec- 
tion. For a given true cross section, CTTrue, there is a cer- 
tain median value, <7Med, that is exceeded exactly 50% of 
the time by the computed upper limit. Fig. |^(a) shows 
CMed/cTruo as a function of ft. The dotted curve used Co 
to determine the upper limit, the dash-dotted curve used 
PMax, the dashed one used CMax, and the solid, jagged, 
curve used the Poisson method. The Poisson method 
gives a jagged curve because of the discrete nature of the 
variable used to calculate the upper limit, the total num- 
ber of detected events. For any cross section shape, when 
there is no background, CMax gives a stronger limit than 
PMax in most random experiments, and both are stronger 
than Co- Even without background, for some values of 
the true fi, Cm & x gives a stronger (lower) upper limit than 
the Poisson method. This happens because the discrete 
nature of the Poisson method causes it to have greater 
than 90% coverage. 

Although test (a) is presented as a comparison of meth- 
ods in the absence of background, it can also be consid- 
ered to be a comparison of methods when the background 
is distributed the same as the signal. If the unknown 
background happens to have the same distribution as the 
signal would have, essentially no sensitivity is lost by us- 
ing the optimum interval method with CMax instead of 
the Poisson method. 

Test (b) was similar to test (a), but the Monte Carlo 
program simulated a background unknown to the exper- 
imenters, and distributed differently from the expected 
signal. The total experimental region was split into a 
high part and a low part, with background only in the 
low part. Half the expected signal was placed in the low 
part, where the simulated background was twice the ex- 
pected signal. For this case, the two lowest curves are 
almost exactly on top of each other; Fig. ||(b) shows that 
CMax and PMax get equally strong upper limits. Co pro- 
duces a weaker limit, and the Poisson method is weakest 
of all. 

From the definition of the 90% confidence level upper 
limit, test (a) results in an upper limit that is lower than 
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FIG. 4: Fraction of cases for test (b) (see text) in which the 
true cross section was higher than the upper limit on the cross 
section computed using Co (dotted), pMax (dash-dotted) and 
C M ax (dashed). 



the true value exactly 10% of the time; i.e., all methods 
except the Poisson make a mistake 10% of the time (the 
discrete nature of the Poisson distribution results in its 
making mistakes less than 10% of the time). But for test 
(b), the unknown background raises the upper limit; so 
all methods make a mistake less than 10% of the time. 
Figure shows the fraction of mistakes with test (b) us- 
ing C (dotted), PMax (dash-dotted) and C*Max (dashed). 
Although CMax and PMax give equally strong upper lim- 
its for test (b), CMax makes fewer mistakes. Co makes 
the most mistakes of the tested methods. Not shown is 
the Poisson method; because its upper limit is so high, it 
makes almost no mistakes. 



V. CONCLUSIONS 

Judging from the tests shown in Fig. || and Fig. ||, 
the best of the methods discussed here is the optimum 
interval method, with CMax- This method is useful for 
experiments with small numbers of events when it is not 
possible to make an accurate model of the background, 
and it can also be used when experimenters want to show 
an especially reliable upper limit that doesn't depend 
on trusting their ability to model the background. Be- 
cause the optimum interval method automatically avoids 
parts of the data range in which there are large back- 
grounds, it is relatively insensitive to placement of the 
cuts limiting the experimental range. Because the op- 
timum interval method doesn't use binned data, it can- 
not be biased by how experimenters choose to bin their 
data. Unlike Bayesian upper limits with a uniform prior, 
the result of the optimum interval method is unchanged 
when a change in variable is made. The optimum interval 
method produces a true, though possibly conservative, 
classical (frequentist) confidence interval; at least 90% of 
the time the method is used its 90% confidence level up- 
per limit will be correct, barring experimental systematic 
errors. 
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APPENDIX A: DERIVATION OF THE 
EQUATION FOR C 

In order to derive Eq. |^, let us first find the probability 
that the maximum gap size is less than x when there 
are exactly n events, then get Co by averaging n over a 
Poisson distribution. 

We assume n events are distributed in some variable, 
?/, according to a density distribution that integrates to 
a total of fj, expected events, and define P(x; n, /i) to be 
the probability that the maximum gap size is less than 
x. As explained in Section II, one may make a change 
of variables to z(y) such that the density distribution is 
uniform over < z < /i. P(x; n, fi) is the probability 
that the maximum z coordinate distance between adja- 
cent events is less than x given that there are exactly 
n events distributed randomly, independently, and uni- 
formly between z — and z = fj,. The function P de- 
pends only on x, n, and fi, but not on the shape of the 
original density distribution. 

The problem of finding P(x; n, /i) can be simplified by 
making a coordinate change w(z) = zj\x. The new co- 
ordinate runs from to 1 instead of to fj,. With this 
coordinate change, any set of n events with x equal to the 
maximum gap between adjacent events becomes a set of 
n events, still uniformly distributed, but with maximum 
new coordinate distance between adjacent events equal 
to x/fi. It follows that P(cc//i; n, 1) = P(x; n, fi), and we 
need only solve the problem of finding P for \i = 1 to get 
the solution for any value of /i. When [i is understood to 
be 1, it will be dropped, and we will write P(x;n) to mean 
the same as P(x;n, 1). The problem has been reduced 
to one in which n points have been scattered randomly 
in independent uniform probability distributions on the 
interval (0, 1). We want to find the probability that the 
maximum empty interval has length less than x. We do 
this with the help of a recursion relation that allows one 
to compute P(x; n + 1) from knowledge of P(x; n). 

P[x; n+ 1) is the integral over t < x of the probability 
that the lowest event is between t and t + dt and that the 
rest of the n events in the remaining 1-t range has no gap 
greater than x. The probability that the lowest event is 
between t and t + dt is (number of ways of choosing one 
particular event of the n + 1 events) times (probability 
that the particular event will be between t and t + dt) 
times (probability that each of the other n events will 
be greater than t). We get a factor in the integrand 
(n + 1) X dt x (1 — t) n . The other factor in the integrand 
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is the probability that there is no gap greater than x for 
the remaining n events: P(x; n, 1 — t) = P(x/(1 — t); n). 
The recursion relation for < x < 1 is 



P{x;n + 1) 



dt(n + l)(l-t) n P 



(Al) 



It is convenient to distinguish between various pieces 
of the x range between and for it will turn out that 
P(x; n, n) takes on different forms in different pieces of 
that range. If x is in the range fj,/(m + 1) < x < /i/m, 
we say P(x;n,/J,) = P m (x;n, /i), and we say x is in the 
m'th range. Let us again restrict ourselves to fi = 1 and 
consider Eq. Al. If X is in the m'th range and, as in 
Eq. Al, < t < x, then x/(l — t) is in either range m or 
range (m — 1). The boundary between these two ranges 



is at x/(l 
becomes 



t) = 1/m; so t = 1 



For m > Eq. 



Al 



P m (x;n + 1) 
n + 1 



(A2) 

The appearance of m — 1 brings up the question of what 
happens if m = 0. Let us interpret the m = range to 
be the one with \/\ < x < 1/0 = oo. Since the empty 
space between events is certainly less than the length of 
the whole interval, Po(x;n) = 1. 
For to > it can be shown that 



(A3) 



fc=0 



In this equation, we interpret (£) as 



(2) 



r(n + l) 



fc!(n-fe)! r(fe + l)r(n - Jfe + 1) ' 



The gamma function is meaningful when analytically 
continued, in which case (?) is zero if k is an integer 
that is less than zero or greater than n. In P(x;0), the 
maximum (and only) gap is always 1; so Pq(x: 0) = 1 for 
x > 1, while for to > 0, when < x < 1, P m (x; 0) = 0. 



Since Eq. A3 is easily verified to be correct for all m > 
when n — 0, one may use induction with Eq. |A2| to prove 
Eq. A3 for all other n > 0. The simple but somewhat 



tedious manipulations of sums will not be given here, ex- 
cept for a useful identity in the induction step: 

C2) + (5U) = (* +1 )- 



It follows from Eq. A3 that 



P m (x; n, M ) = EM)" (1 - kx/») n . (A4) 



k=0 



Let us now compute Co, the probability for the max- 
imum empty space between events in (0, ji) being less 



than x given only that events are th row n according to 
a uniform unit density. Average Eq. A4 over a Poisson 
distribution with mean [i to get 



. .it 

C ° = E E e '^(~ 1 ) k C 1 - W/T: (A5) 



k=0 n=0 



which can be summed over n (again the manipulations 
will not be shown here) to give Eq. |[ 



APPENDIX B: PECULIARITIES OF C Max 

The function CMax(-9,/x) has certain peculiarities. For 
example, it cannot be defined for /i < 2.3026. Random 
experiments with /i < 2.3026 either give the largest pos- 
sible value of CMax, which occurs for zero events, with 
probability e _/i > 10%, or give smaller values with prob- 
ability 1 — e _Al < 90%. There is therefore no number, 
C l Max(-9, fj), for which there is exactly 90% probability 
°f C*Max < C < Max(-9, /i). No cross section resulting in 
fi < 2.3026 can be excluded to as high a confidence level 
as 90%. 

Another peculiarity of CMax(-9,^) is that it is not es- 
pecially smooth; it tends to increase rapidly near certain 
values of /i. To understand this behavior, note that for 
a given value of n, the maximum possible value of x is 
x = /i. Thus the maximum possible value over all x of 
C n (x,n) is C n (fi,fj,). If C n (fj,,(i) is less than C' Max (.9,/i) 
then intervals with n events cannot have Civiax = C n for 
that value of /i. Furthermore, since C„(x,/x) decreases 
with increasing n, intervals with m > n events also have 
C m < CMax- For low enough fi, only intervals with n = 
need be considered. In this case, the 90% confidence up- 
per limit for Cyiax occurs when x in Cq(x, /i) is equal to 
Xo(-9,/i), where xo{C,n) is the inverse of Cq(x,ia); it is 
defined as the value of x$ for which Co (&o , n) = C. Thus 
for low enough /i (but above 2.3026) 



C Max (-9, n) = C (x {.9,[i),n). 



(Bl) 



Co(cco(.9, /z), n) = 0.9 from the definitions of Co and xq. 
This formula for CMax breaks down as soon as /i is large 
enough to have Cx(/x, fx) > CMax(-9,/i), for at this value 
of fi it is possible for an interval with n = 1 to be CMax- In 
general, the threshold /i for intervals with n points being 
able to produce CMax for confidence level C is where 



C n (jjt,n) = C Max (C,fi). 



(B2) 



Every time a threshold in /i is passed that allows another 
value of n to participate in producing CMax, the value of 
Cviax(C, /i) spurts upward. 

If one considers all intervals with < n events, then 
the largest expected number of events is less than fi if 
and only if there are more than n events in the entire 
experimental range. Thus C n (p, /i) is the probability of 
> n events in the entire experimental range: C n (p, //) = 
P(/i,n + 1) of Eq. H This equation, with Eq. |B2| , can 
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TABLE I: Threshold /i for which intervals with > n events 
need not be considered when computing Civiax- 



n 


/i(n) 


H(n+1) 


n(n+2) 


/i(n+3) 


M (n+4) 





2.303 


3.890 


5.800 


7.491 


9.059 


5 


10.548 


12.009 


13.433 


14.824 


16.196 


10 


17.540 


18.891 


20.208 


21.520 


22.821 


15 


24.119 


25.400 


26.669 


27.926 


29.197 


20 


30.457 


31.690 


32.972 


34.203 


35.422 


25 


36.632 


37.849 


39.108 


40.333 


41.546 


30 


42.768 


43.978 


45.164 


46.351 


47.544 


35 


48.734 


49.944 


51.139 


52.314 


53.488 



be used to compute the thresholds in fi where n events 
first need to be included when trying to find C*Max in a 
calculation of the 90% confidence level. These thresholds 
are tabulated in table |. As an example of usage of this 
table, if you are evaluating Civiax for a 90% confidence 
level calculation with fj, = 20, you can ignore intervals 
with more than 11 events. 

The many rapid increases in CMax(-9, p) of Fig. 
when thresholds given in table | are crossed. 



occur 



APPENDIX C: PROBABILITY OF MORE 
EVENTS THAN OBSERVED IN AN INTERVAL 

Instead of using C n (x, fi) as a measure of how strongly 
a given interval with n events excludes a given cross sec- 
tion, one may use p n (x), the calculated Poisson proba- 
bility of there being more events in a random interval of 
that size than were actually observed. This probability 
is P(x, n + 1), as defined in Eq. ^. If p n is too large, then 
the cross section used in the calculation must have been 
too large. For a given cross section, find the interval that 
excludes the cross section most strongly; i.e., find the 
interval that gives the largest calculated probability of 
there being more events in the interval than were actu- 
ally observed. In other words, as was done with Cjyiax 
of the optimum interval method, define pMax to be the 
maximum over the p n for all possible intervals. If random 
experiments for the same given cross section would give 
a smaller pMax 90% of the time, then the cross section 
is rejected as too high with 90% confidence level. The 
function, PMax(C, m)j is defined as the PMax for which 
confidence level C is reached at the given fi. 

Although this method may not be as effective as the 
optimum gap method, it is much easier to calculate 
Pn(x) — P(x, n + 1) than it is to calculate C n (x, /i). 

Much of the reasoning applied to the optimum interval 
method applies here. As was the case for the optimum 
interval method, PMax(C, /i) depends only on C and /z, 
but not otherwise on the shape of the cross section. As for 
the optimum interval method, PMax(-9, /x) is not defined 
for ^ < 2.3026. For sufficiently low n above 2.3026 Eq. [Bl 
becomes 
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FIG. 5: Plot of piviax(.9,/i), the value of f>Max for which the 
90% confidence level is reached, as a function of the total 
number of events fi expected in the experimental range. 



TABLE II: Threshold p below which intervals with > n events 
need not be considered when computing p max for the 90% 
confidence level. 



n 




A*(n+1) 


M(n+2) 


M(n+3) 


M (n+4) 





2.303 


5.156 


7.584 


9.661 


11.599 


5 


13.427 


15.193 


16.900 


18.559 


20.176 


10 


21.771 


23.355 


24.880 


26.419 


27.922 


15 


29.428 


30.891 


32.359 


33.808 


35.251 


20 


36.701 


38.100 


39.519 


40.913 


42.317 


25 


43.700 


45.091 


46.465 


47.827 


49.193 


30 


50.561 


51.902 


53.255 


54.589 


55.926 


35 


57.264 


58.603 


59.920 


61.237 


62.549 


40 


63.868 


65.179 


66.478 


67.791 


69.080 



For the threshold [i at which intervals with n points be- 
com e a ble to contribute to PMax for confidence level C, 
Eq. B2 becomes 



P{H,n+ 1) =p„(/i) = J3Max(C', y). 



(C2) 



PMz 



t (.9,//)=poM.9,M)) = e-^- 9 ^. 



(CI) 



A Monte Carlo program was used to compute a table 
of PMax(0.9,/x) for n < 70, and the function is plotted in 
Fig. I 

Table || shows approximate values of the threshold \x 
calculated using Eq. |C^ with C = 0.9 for each n from 
to 44. The third digit of /j, does not really deserve to 
be trusted since pMax was computed from a Monte Carlo 
generated table. 

Appendix B explained why the value of CMax(-9,/^) 
spurts upward when fi crosses a threshold where inter- 
vals with more points can contribute to CMax- A much 
less obvious similar effect occurs with PMax(_-9, /i). No- 
tice the irregularity in the curve of Fig. ^| just after 
(i = 5.156, where n = 1 first begins to contribute. Be- 
tween /i = 2.3026 and /i = 5.156, Eq. CI applies, but af- 
ter fi — 5.156, PMax shoots above this form. The smaller 
irregularity above \i — 7.584, where n — 2 begins to con- 
tribute, is barely visible. 
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